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CHIMERIC MUTATIONAL VECTORS HAVING NON- NATURAL NUCLEOTIDES 

1 . FIELD OF THE INVENTION 

The invention concerns compounds and methods of their 
use to make specific genetic alterations in the genome of 
target eukaryotic cells. More specifically, the invention 
concerns the introduction into the nucleus of the target cell 
of an oligonucleobase compound, termed a Chimeric Mutational 
Vector (CMV) , having a sequence that has regions of homology 
and one or more differences with a gene of the target cell 
that IS to be altered (the "target gene"). The structure of 
the CMV is designed so that genetic recombination between the 
CMV and the target gene occurs, i.e., the sequence of the CMV 
replaces the sequence of the target gene. 

2 . BACKGROUND OF THE INVENTION 

2.1. Site-Directed Genetic Alteration in Eukaryctic Cells 

Those skilled in the art of molecular biology recognize 
that on frequent occasions it is desired not merely to 
introduce a new polynucleic acid sequence, i.e, a new gene, 
into a targeted eukaryotic cell, but rather to alter a 
defined, pre-existing gene in the targeted cell. The 
targeted cell can be used in culture or it can be used to 
construct a transgenic animal. 

A wide variety of techniques have been developed to 
introduce DNA into cultured eukaryotic cells. These 
techniques include calcium phosphate precipitation and DEAE- 
dextran mediated endocytosis, electroporation, liposome 
mediated fusion and transduction with replication incompetent 
viruses. However, while such techniques can quite often 
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introduce functional genes into the eukaryotic cell, these 
techniques do not readily accomplish an alteration (mutation) 
m a specific existing gene. After introduction the exogenous 
DNA IS isolated at a random position in the cell's genome by 
illegitimate recombination, rather than at a specific 
position by homologous reconribinat ion . 

Prior to the present invention, there was no generally 
satisfactory scheme for introducing a site-specific genetic 
alteration in a higher eukaryote, i.e, m mammalian or avian 
cells. Although homologous recomlDinat ion can be obtained in 
higher eukaryotic cells by introduction of very long {> 1 kb) 
nucleic acids, these techniques require the application of 
elaborate selection techniques because the rate of 
illegitimate recombination m higher eukaryotes greatly 
exceeds that of homologous recombination. Thomas, K.R. & 
Capecchi, M.R., 1987, Cell 52:503. See, also, Valancius, V. Sc 
Smithies O., 1991, Mol . Cell. Biol. 11:4389 (comparison 
homologous recombination of linearized and supercoiled 
plasnids in eukaryotic cells) . 

One approach to achieving a predominantly site-directed 
mutagenesis has been the introduction of single stranded 
oligodeoxynucleotides directly into the cell. This 
techniques has been successfully employed in the yeast 
Saccharomyces cerevisiae , in which homologous recombination 
is significantly more active than it is m higher eukaryotes. 
Moerschell, R.P., et al . , 1988, Proc . Nat 1 . Acad . Sc i . 85:524- 
28; Yamamoto, T., et al . , 1992, Yeast 8:935-48. However, to 
date there have been no reports of the successful 
transformation of mammalian or avian cells by single stranded 
oligonucleot ides . 
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A relationship between the structure of the target DNA 
and the rate of homologous recombination in mammalian can be 
inferred by studies that show that regions of alternating 
purine and pyrimidine bases, i.e., [d (TG) 3o*ci ( AC) 30] , display 
an entranced rate of recombination. These effects were 
demonstrated in studies of non-replicating plasmids in 
cultured mammalian cells. Wahls, W.P., et al . , 1990, Mol , 
Cell. Biol. 10:785-93, These experiments were not extended 
to show recombination between an exogenous nucleic acid and 
the genome of the cell. 

Attempts have been made to use RecA, a protein that 
promotes homologous recombination in the bacteria, E. coll, 
to promote homologous recombination in eukaryotic cells. 
However, these attempts have not been clearly successful. 
For example U.S.Pat No. 4,950,599 to W. Bertling discloses a 
very low rate of site-directed mutation and no enhancement in 
the rate of homologous recombination by use of RecA in 
eukaryotic cells. Patent publications WO 93/22443 to D. 
Zarling and E. Sena, and publication 94/04032 to D.C. 
Gruenert and K. Kunzelmann both purport to correct a genetic 
defect in a cultured cell line related to cystic fibrosis. 
These publications disclose primarily experimental data that 
demonstrate the principle rather than data concerning 
examples of operative methods. Thus, to introduce 
polynucleotide/RecA complexes access to the nucleus, Zarling 
and Gruenert employ cells that were membrane -permeabili zed, 
although such cells are incapable of further growth. 
Moreover, even when RecA-promoted homologous recombination 
was asserted to have taken place in intact cells, these 
publications provide no quantitative estimates of its 
frequency. Thus, the use of prokaryotic recA has not been 
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convincingly shown to result in a rate homologous 
recombination in any viable eukaryotic cell significantly 
greater than the spontaneous rate of homologous 
recombination . 

2.2. Chimeric Oligonucleotid es Havmo DNA»RNA Base Pairs 

The inclusion of a publication or patent application m 
this section is not to be understood as an admission that the 
publication or application occurred pricr to the present 
invention or resulted from the conception of a person other 
than the inventor. 

An oligonucleotide having complementary 
deoxyribonucleot ides and ribonucleotides and containing a 
sequence homologous to a fragment of the bacteriophage 
M13mpl9, was described m Kmiec, E.B., et al . , November 1994, 
Mol . and Cell. Biol. 14:7163-7172. The oligonucleotide had a 
single contiguous segment of ribonucleotides. Kmiec et al . 
showed that the oligonucleotide was a substrate for the REC2 
homologous pairing enzyme from Ustilago maydis. 

Patent publication WO 95/15972, published June 15, 1995, 
and corresponding U.S. patent application Serial No. 
08/353,657, filed December 9, 1994, by E.B. Kmiec, described 
CMV for the introduction of genetic changes in eukaryotic 
cells. Examples in a Ustilago maydis gene and in the murine 
ras gene were reported. The latter example was designed to 
introduce a transforming mutation into the ras gene so that 
the successful mutation of the ras gene in NIH 3T3 cells 
would cause the growth of a colony of cells 

{"transformation"). The WO 95/15972 publication reported 
that the maximum rate of transformation of NIH 3T3 was less 
than 0.1 %, i.e., about 100 t ransf ormant s per 10*^ cells 
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exposed to the ras CMV. In the Ustilago waydis system the 
rate of transf ormant s was about 600 per 10^. A chimeric 
vector designed to introduce a mutation into a human bcl-2 
gene was described in Kmiec, E.B., February 1996, Seminars in 
Oncology 23:188. 

A CMV designed to repair the mutation in codon 12 of K- 
ras was described in Kmiec, E,B., December 1995, Advanced 
Drug Delivery Reviews 17:333-4 0. The CMV was tested in Capan 
2, a cell line derived from a human pancreatic 

adenocarcinoma, using LI POFECTIN"^^ to introduce the CMV into a 
the Capan 2 cells. Twenty four hours after exposure to the 
CMV, the cells were harvested and genomic DNA was extracted; 
a fragment containing codon 12 of K-ras was amplified by PCR 
and the rate of conversion estimated by hybridization with 
allele specific probes. The rate of repair was reported to 
be approximately 18%. 

A CMV designed to repair a mutation in the gene encoding 
liver/bone/kidney type alkaline phosphatase was reported in 
Yoon, K. , et al,, March 1996, Proc . Natl, Acad. Sci . 93:2071, 
The alkaline phosphatase gene was transiently introduced into 
CHO cells by a plasmid. Six hours later the CMV was 
introduced. The plasmid was recovered at 24 hours after 
introduction of the CMV and analyzed. The results showed 
that approximately 30 to 38% of the alkaline phosphatase 
genes were repaired by the CMV . 

United States Patent Application Serial No. 08/640,517, 
filed May 1, 1996, by E.B. Kmiec, A. Cole-Strauss and K. 
Yoon, discloses methods and CMV that are useful in the 
treatment of genetic diseases of hematopoietic cells, e.g., 
Sickle Cell Disease, Thalassemia and Gaucher Disease. 

- 5 - 
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3. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 . General form of one embodiment: of a Chimeric 
Mutat lonal Vector . 

Figures 2A and 2B. Figure 2A shows the sequence and 
structure of oligonucleotide Dhl (SEQ ID No: 18) and chimeric 
oligonucleotides Chi (SEQ ID No:15), Ch2 (SEQ ID No:16)and 
Ch3 (SEQ ID No:17) . Figure 2B illustrates the relationship 
between the sequence of the chimeric vector Chi and nt 693- 
728 of the alkaline phosphatase gene (SEQ ID No:19), DNA 
nucleotides are UPPER CASE; RNA nucleotides are lower case. 
Figure 3 . The sequences of the codons 3-9 and adjacent 
dmucieot ides of codons 2 and 10 of l^'^-glcbm {nt 1-25 of SEQ 
ID No:21), f?>'^-globin (nt 1-25 of SEQ ID No ; 2 0 ) , 6-globin (SEQ 
ID No: 25), and chimeric vectors SC1-SC5 (SEQ ID Nos : 20-24). 
DNA and RNA nucleotides are indicated as m Figure 2A. 
Figure 4A and 4B. Figure 4A and 4B show the fraction of 
copies of S-globin converted from to as a function of nM 
SCI added and of 1^'^ to 15^ as a function nM of SC2 , 
respectively, in cultures of EB - transformed lymphoblast s . 
Figure 5 . Figure 5 shows the fraction of copies of l?»-globin 
converted 1^^ to as a function of ng SC2 added to cultures 
of cd34* hematopoietic stem cells. 



4 - DEFINITIONS 

The invention is to be understood in accordance with the 
following definitions . 

An q1 igonucleobase is a polymer of nucleobases, which 
polymer can hybridize by Watson-Crick base pairing to a DNA 
having the complementary sequence. 

Nucleobases comprise a base, which is a purine, 
pyrimidme, or a derivative or analog thereof. Nucleobases 
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include peptide nucleobases . the subunits of peptide nucleic 
acids, and niorpholine nucleobases as well as nucleosides, 
nucleotoids and nucleotides. Nucleosides are nucleobases 
that contain a pentosef uranosyl moiety, e.g,, an optionally 
substituted riboside or 2 ' -deoxyriboside , and have a linkage 
to other nucleobases that does not contain a phosphorus. 
Nucleotoids are pentosef uranosyl containing nucleobases 
having linkages that contain phosphorus, e.g., 
phosphorothioates , phosphoramidates and methylphosphonates , 
but not phosphates. Nucleotides are pent osef uranosyl 
containing nucleobases that are linked by an unsubst ituted 
phosphodiesters . 

A oligonucleobase chain has a single 5' and 3' 
terminus, which are the ultimate nucleobases of the polymer. 
A particular oligonucleobase chain can contain nucleobases of 
all types. An oligonucleobase CQmpgynd is a compound 
comprising one or more oligonucleobase chains that are 
complementary and hybridized by Watson-Crick base pairing . 

Nucleobases are either deoxyribo- type or ribo-type. 
Ri bo- type nucleobases are pentosef uranosyl containing 
nucleobases wherein the 2 * carbon is a methylene substituted 
with a hydroxyl, alkyloxy or halogen. Deoxvribo- t voe 
nucleobases are nucleobases other than ribo-type nucleobases 
and include all nucleobases that do not contain a 
pentosef uranosyl moiety . 

An ol iQonucleobase strand generically includes both 
oligonucleobase chains and segments or regions of 
oligonucleobase chains. An oligonucleobase strand has a 3' 
end and a 5' end. When a oligonucleobase strand is 
coextensive with a chain, the 3' and 5' ends of the strand 
are also 3' and 5* termini of the chain. 

- 7 - 
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A region is a portion of 
sequence of which is derived f 
e.g., a CMV having a region of 
the sequence of a fragment of 
segment is a portion of a CMV 
structural feature. A given s 
contain both 2 ' - deoxynucleot id 
However, a ribo-type segment o 
contain only ribo-type and 2'- 
respect ively . 



an ol igonucleobase , the 
rom some particular source, 

at least 15 nucleotides having 
the human fi-globin gene. A 
having some characteristic 
egment or a given region can 
es and ribonucleotides, 
r a 2 ' -deoxyribo- tvoe segment 
deoxyribo- type nucleobases 



5 . SUMMARY 

The present invention provides ol igonucleobase 
compounds, termed Chimeric Mutational Vectors (CMV). CMV can 
be used to introduce specific genetic changes in plant and 
animal cells. The invention is applicable to in the field of 
medicine for gene therapy, and in the fields of biomedical 
research, pharmaceutical production and agriculture to 
construct specifically mutated plants and animals. A CMV 
comprises two complementary ol igonucleobase strands. The two 
strands can be present on a single chain or on two chains 
that can be optionally linked by any chemistry for cross- 
1 inking oligonucleotides . 

The sequence of the strands of a CMV is homologous to 
the target gene except for a mutator region which introduces 
the genetic change into the target gene. The CMV can also 
contain regions having a sequence unrelated to the target 
gene. The mutator region must be directly adjacent in both 
3* and 5' directions to a homology region of at least one 
base . 
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The oligonucleobases of the CMV are either ribo-type or 
2 ' -deoxyribo- type . Ribo-type nucleobases contain a 
pentosef uranosyl moiety having a 2' oxygen or halogen. At 
least three contiguous bases of the homologous region of the 
first strand are ribo-type nucleobases that are Watson-Crick 
base paired to deoxyribo- type nucleobases of the second 
strand. Nucleobases that are sensitive to RNase are not 
preferred for- use in the invention. 

6. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides compounds, termed 
Chimeric Mutational Vectors (CMV) that can be used to make 
specific changes in the genome of a eukaryotic cell. The CMV 
are comprised of polymers of purines and pyrimidines that 
hybridize, i.e., form a Watson-Crick base pairs of purines 
and pyrimidines, to DNA having the appropriate sequence. 
Each CMV is divided into a first and a second strand of at 
least 15 bases each that are complementary to each other and 
can be, but need not be, covalently linked. The polymers of 
purines and pyrimidines, termed oligonucleobases, are 
comprised of two types of subunits, termed nucleobases. 
There are two types of nucleobases. Ribo-type nucleobases 
are ribonucleosides having a 2 ' hydroxyl , substituted hydroxyl 
or 2 ' halo- substituted ribose. All nucleobases other than 
ribo-type nucleobases are deoxyribo- type nucleobases. 

The sequence of the first and second strands consists of 
at least two regions that are homologous to the target gene 
and one or more regions {the "mutator regions") that differ 
from the target gene and introduce the genetic change into 
the target gene. The mutator region is directly adjacent to 
homologous regions in both the 3' and 5' directions. In a 
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preferred embodiment of the invention, each mutator region is 
adjacent in both the 3' and 5' directions to homologous 
regions of at least three bases. In a preferred embodiment 
of the invention, each mutator region is flanked m both the 
3' and 5' directions by ribo-type ol igonucleobase segments of 
at least three bases, which segments need not be adjacent to 
the mutator region. The flanking ribo-type nucleobase 
segments need not be directly adjacent to the mutator region, 
i.e, a portion of the homologous region comprising decxyribo- 
type nucleobases can intervene. The total length of all 
homologous regions is preferably at least 14 bases. If the 
CMV contains two homologous regions separated by a mutator 
region, the homologous regions can more preferably be each 
between 8 and 12 bases long and most preferably be IC bases 
long . 

At least two homologous regions of the first strand are 
comprised of at least three contiguous ribo-type nucleobases 
which are Watson-Crick paired to deoxyribo- type nucleobases 
of the second strand. In a preferred embodiment there are 
between 9 and 25 ribo-type nucleobases and more preferably 20 
ribo-type nucleobases m the first strand, which are Watson- 
Crick paired to deoxyribo- type nucleobases of the second 
strand. In one embodiment there are no ribo-type nucleobases 
m the second strand. In one embodiment the mutator region 
of the first strand consists of deoxyribo- type nucleobases 
and IS flanked by deoxyribo- type nucleobases. Alternatively, 
the mutator region can be comprised of ribo-type nucleobases 
of the first strand and deoxyribo- type nucleobases of the 
second strand. 

The crw IS still further characterized by containing at 
least three nuclease resistant ribo-type nucleobases. In a 

- 10 - 
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preferred embodiment all ribo-type nucleobases are nuclease 
resistant . 

The mutator region can be as large as 2 kilobases and 
can encode an exon . Preferably the mutator region consists 
of 2 0 or fewer bases, more preferably 6 or fewer bases and 
most preferably 3 or fewer bases. The mutator region can be 
of a length different than the length of the sequence that 
separates the regions of the target gene homologous with the 
homologous regions of the CMV so that an insertion or 
deletion of the target gene results. When the CMV is used to 
introduce a deletion there is no base identifiable as within 
the mutator region. Rather, the mutation is effected by the 
juxtaposition of the two homologous regions that are 
separated m the target gene. In one embodiment the mutator 
region is a deletion of from 6 to 1 bases or more preferably 
from 3 to 1 bases. Multiple separated mutations can be 
introduced by a single CMV, in which case there are multiple 
mutator regions in the same CMV. Alternatively multiple CMV 
can be use simultaneously to introduce multiple genetic 
changes in a single gene or, alternatively to introduce 
genetic changes in multiple genes of the same cell. 

In a preferred embodiment the CMV is resistant to RNase . 
Therefore, the use of only naturally occurring ribo-type 
nucleobases, which are sensitive to RNase, is not suitable 
for use m the invention. The ribo-type nucleobases of a CMV 
should include at least three ribo-type nucleobases 
preferably selected from non-phosphodiester linked 
ribonucleotides ( " ribonucleotoids *' ) , 2 ' O- substituted or 
2 • haloribonucleotides , 2 * 0-subst i tuted or 

2 • haloribonucleotoids , and ribonucleosides , which can be 
optionally 2' substituted. In a preferred embodiment of the 

- 11 - 
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C^^V, no nuclease sensitive, i.e., 2 * O-ribonucleocides , are 
used . 

In one embodiment the CMV is a single ol igonucleobase 
chain of between 40 and 100 bases. In an alternative 
embodiment, the CMV comprises a first and a second 
oligonucleobase chain, each of between 20 and ICQ bases; 
wherein the first chain comprises the first strand and the 
second chain comprises the second strand. The first and 
second chains can be linked covalently or, alternatively, can 
be associated only by Watson-Crick base pairings. 

6,1. USES OF CHIMERIC MUTATIONAL VECTORS 

Chimeric Mutational Vectors can be used to introduce 
changes into the sequence any eukaryotic gene, which has a 
known sequence. The change can result in the replacement of 
one or more nucleotides or can be an insertion or deletion of 
one or more nucleotides. In preferred embodiments the 
replacement, insertion or deletion can be of 20 or fewer 
contiguous bases, in a more preferred embodiment the 
replacement, insertion or deletion can be of 6 or fewer bases 
and most preferably of 3 or fewer bases. The insertion can 
be as long as about two kilobases. The insertions or 
deletions can be made in the coding and the regulatory parts 
of the gene . 

Cells can be transfected with the CMV by any technique 
now known or to be developed for transfecting cells with DNA . 
Such techniques include electroporat ion , liposome transfer 
and calcium phosphate precipitation. In one embodiment the 
transfection is performed with a liposomal transfer compound, 
e.g., DOTAP (N- ( 1 - (2,3 -Dioleoyloxy ) propyl] -N , N, N- 
trimethylammonium methylsul f ate , Boehringer-Mannheim) or an 
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equivalent, such as LIPOFECTIN. The amount of CMV is not 
critical to the practice of the invention; good results can 
be achieved with 10 nM/lO- cells. A ratio of about 500 ng of 
CMV m 3 /ig of DOTAP per 10^ cells can be used. The 
transfection technique of Examples 1-3, infra., can be used 
with the modification that the transfected cells are cultured 
in serum free media, media supplemented with human serum 
albumin or human serum. 

The present invention encompasses methods of using 
Chimeric Mutational Vectors. Uses of Chimeric Mutational 
Vectors include the repair of genetic diseases such as 
Gaucher Disease, thalassemia and Sickle Cell disease. Other 
applications include the introduction of stop codons or frame 
shift mutations to make "knock-outs," i.e., transgenic 
animals or plants that lack a functional copy of a specific 
gene, as well as transgenic animals or plants having specific 
mutations. In a still further method of use encompassed by 
the invention specific mutations can be made for the purpose 
of studying the structure function relationships of genes-of- 
mterest. Alternatively, if a desirable mutarion has been 
identified in one species, it can be introduced in the 
homologous genes of other species by use of the CMV. 

For medical purposes, the invention can be used to 
repair mutations or introduce mutations into any cell-type 
that can be removed from a subject's body, cultured and 
reimplanted into the subject. Techniques for the removal, 
culture and reimplantation of hepatocytes, in particular of 
hepatic reserve (stem) cells, have been described in patient 
publication WO94/08598 to Naughton, G.B. and Sibanda, B. , 
April 28, 1994. Examples of genetic diseases that can be 
cured by repair of mutations in hepatocytes include: familial 

- 13 - 
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hypercholesteremia, caused by mutation m the LDL receptors- 
emphysema that caused by a mutation in the al -ant itrypsin 
gene; and hemophilia and Christmas Disease, which are caused 
by mutations m coagulation factors VIII and IX, 
respectively . 

In a yet further use of the invention, CMV can be used 
to mutagenize a population of cells so that a mutant, having 
a selectable phenotype can be obtained. According to this 
aspect of the invention, a mixture of CMV having a mutator 
region of one or several nucleotides is synthesized so that 
the three non-wild type nucleotides are present at each 
position of the mutator region. The treatment of a 
population of cells with such a mixture of CMV will induce a 
variety of mutations in the target gene. After an 
appropriate selection step, a mutant having the desired 
phenotype can be recovered. 

6.2, STRUCTURE OF EXEMPLARY CHIMERIC MUTATIONAL 

VECTORS 

6.2.1. SINGLE CHAIN CHIMERIC MUTATIONAL VECTORS 



In one embodiment a Chimeric Mutational Vector (CMV) is 
a single 5 ',3 '-linked ol igonucleobase chain comprised of 
between about 40 and about 100 pentosef uranosyl containing 
nucleobases. A single chain CMV can contain unpaired 
nucleotides, which form one or two hair-pin turns, which turn 
or turns divide (s) the CMV into the first and second strands, 
so that at least 15 bases of the first strand can be Watson- 
Crick paired to bases of the second strand. 

Figure 1 shows the structure of one embodiment of a 
smgle-chaim CMV with segments (a) - (h) . In the embodiment of 
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Figure 1, the first strand consists of segments (c) , (d) and 
(e) , which is complementary to a second strand consisting of 
segment (a) . In this particular embodiment the 3 ' terminus 
of the CMV is illustrated to be at the 3' end of the (a) 
segment and the 5* terminus is shown to be at the 5' end of 
the (h) segment. However, the location of the termini and 
the orientation of 3' and 5' directions of the CMV with 
respect to the segments can be elsewhere, so long as the 
termini do not interrupt the homologous or mutator regions of 
the first or second strands. The segments are labeled 
sequentially (a) through (h) . 

In one embodiment the lengths and characteristics of the 
segments are as follows. Segment (a) is between 16 and 40 
nucleotides and preferably between 20 and 30 nucleotides. 
The sequence of the region of segment (a) can be either that 
of the coding strand or the non- coding strand of the gene 
that contains the intended mutation (the "mutated target 
gene"). The location of the sequence of segment (a) must 
include the portion of the target gene that is to be changed. 
Unless the target gene is not normally transcribed in target 
cell, it is preferred that the sequence of segment (a) is the 
sequence of the coding strand of the target gene. When the 
target gene is not transcribed in the target cell, then 
neither the coding strand sequence or the non- coding strand 
sequence is preferred. The sequence of segment (a) 
determines the sequences and combined lengths of segments 
(c)-{e), which must be complementary to segment (a). 

The oligonucleobases of the portion of segment (a) that 
are base paired with segments (c) and (e) can be any 2'- 
deoxyribo- type nucleobases The nucleobase of segments (c) 
and (e) , which are termed ribonucleotide segments, can be any 
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ribo-cype nucleobase 2* O-ribonucleot ides , i.e., nucleotides 
that is known or will be developed. In a preferred 
embodiment, the nucleotides of segment (d) , which is termed 
the intervening segment are 2 ' - deoxyribo- type nucleobases . 
Alternatively, segment (d) can be made of ribo-type 
nucleobases; in which case the boundaries between segments 
(c) , id) and (e) are not defined. Segments (b) and (f) 
through (h) can be of any type of nucleobase. 

In a preferred enabodiment , the sequence of segments (c) 
and (e) are completely homologous to the target gene. 
However, a one base mutator region in the (c) or (e) segment 
can result m the mutation of the target gene at the 
homologous position . 

Segments (b) and (g) are about 4 nucleotides in length 
and form single stranded hairpin turns that allow segments 
(a) and (c)-(e) and segments (f) and (h) to form Watson-Crick 
base pairs, i.e., to form duplex nucleic acids. In 
alternative embodiments the function of segments (b) and (c) , 
which is to covalently link the first and second strands, can 
be served by non-ol igonucleobase moieties. 

Segments (c) and (e) , also termed the first and second 
ribo-type segments, consist, m one embodiment, of 2 ' -O- 
methylr ibonucleot ides . In a preferred embodiment, segments 
(c) and (e) are independently, between 6 and 13 nucleotides. 

Segment (d) , also termed the intervening segment, in one 
embodiment, is between 4 and 20 nucleotides in length. If 
the target gene contains two or more point mutations that are 
separated by fewer than 15 nucleotides, each can be repaired 
by the same CMV. 

Segments (f) and (h) form a duplex that brings the 3' 
and 5' ends of the CMV, which is nicked between segments (a) 
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and (h) into juxtaposition. The structure formed by segments 
(f ) , (g) and (h) is termed a hairpin cap. A hairpin cap 
contains a terminal end and a nonterminal end. The terminal 
end forms a terminus of the chain, which can be either a 5' 
or a 3 ' terminus. The function of a hairpin cap is to 
control the position of the 3' or 5 ' terminus. The non- 
terminal end of the hairpin cap can be ligated to an end of a 
strand, whereby the end of the complementary scrand, which is 
the second terminus of the chain is juxtaposed to the 
terminal end of the hairpin cap, as shown in figure 1. The 
3' and 5' termini can be, in one embodiment, 

dephosphorylated . In an alternative embodiment, the 3' and 
5 ' termini can be covalently linked by a phosphodiester bond 
or equivalent, so that the CMV is a closed circular 
oligonucleotide. Segments (f) and (h) can be optionally 
deleted from a closed circular CMV. In a preferred 
embodiment the orientation of the ol igonucleobase of the 
hairpin cap is the same as the orientation of the strand to 
which It is linked. If the orientation of the hairpin cap is 
ant 1 -parallel to the orientation of the strand to which it is 
attached, tr.e designation as 3 ' or 5 ' of the terminal end of 
the hairpin is determined by the structure of terminal end of 
the complementary . 

In a preferred embodiment, the CMV is a single chain CMV 
containing one hairpin cap, oriented in parallel with the 
chain, having juxtaposed 3' and 5' ends. There are eight 
specific embodiments of this type, which are defined by the 
location of the ligation between the hairpin cap and the 
chain and by whether the sequence of the first strand is the 
sequence of the coding strand or the non-coding strand of the 
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target gene. The eight species are given in Table I. Figure 
1 illustrates species 2 and 6 of Table I . 



TABLE I 



Species 


T .or'r? r "i on o'f 1 ■icration of 
Hairpin Cao and Chain 


Sequence of First Strand 


1 


3 • First Strand 


Ceding 


2 


5' First Strand 


Coding 


3 


3 ' Second Strand 


Coding 


4 


5 ' Second Strand 


Coding 


5 


3 * First Strand 


Non- coding 


6 


5' First Strand 


Non-coding 


7 


3 ' Second Strand 


Non-coding 


8 


5 ' Second Strand 


Non- coding 



6.2.2. DUAL, CHAIN CHIMERIC MUTATIONAL VECTORS 



Alternatively, the CMV can comprise two chains, each 
chain having a 3* and a 5' terminus, wherein a first chain 
comprises the first strand and a second chain comprises a 
second strand. The first and second chains can be cross- 
linked by a covalent linker or the first and second chains 
can be held in association only by Watson-Crick bases 
pairing. The lengths of the regions and segments cf the 
first and second strands of a two chain CMV are constructed 
according to the foregoing guides regarding single chain CMV. 
In one embodiment, the first and second chains can further 
comprise complementary segments of between 3 and 10 bases 
that flank the first and second strands that increase the 
stability of the association between the first and second 
chains . 
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An alternative embodiment of the two chain CMV can 
comprise two ol igonucleobase chains and two hairpin caps; the 
first strand is a part of the first chain and the second 
strand is a part of the second chain. The hairpin caps can 
both be ligated to the ends of one strand. In one specific 
configuration, termed a "cradle" configuration, a hairpin cap 
is ligated to each end of the second strand. In an "anti- 
cradle" configuration, the hairpin caps are ligated to the 
ends of the first strand. An alternative type of 
configuration, termed " head- to- tail , " consists of a hairpin 
cap ligated to each of the strands. Because the strands of 
the CMV hybridize only m an antiparallel fashion, there are 
only two specific configurations of the head-to-tail type: 
the hairpin caps can both be ligated to either the 3' ends or 
the 5' ends of the strands. 

6.3. THE SYNTHESIS OF CMV AND SELECTION OF NUCLEOBASES 
CMV can be synthesized by any technique used to 
synthesize oligonucleotides or oligonucleotide analogs. For 
CMV having chains up to about 100 bases in length, the 
preferred technique is solid-phase synthesis. Alternatively, 
subsegments of CMV chains greater than about 50 bases in 
length can be synthesized by solid phase synthesis and 
ligated by liquid phases techniques, well known to those 
skilled in the art. Wosnick, M.A., 1987, Gene 60:115-117. 
As those skilled in the art appreciate, complementary 
subsegments can be synthesized by solid-phase techniques so 
that when annealed, the ends of the subsegments are 
staggered. By causing adjacent subsegments to have 
complementary staggered ends, adjacent segments can be 
ligated by well known enzymatic processes. By this technique 
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a chain of a CMV considerably larger than 100 bases can be 
synthesized . 

The nucleobases of a chain of a CMV can be any 
nucleobase now known or to be developed that hybridizes by 
Watson-Crick base pairing to DNA. Suitable nucleobases 
include nucleotides and nucleotoids . The structure and 
synthesis of ol igonucleobases having exemplary nucleotoids 
can be found as follows: Phosphorct hioates , Eckstein, F,, 
Ann. Rev. Biochem., 1985, 54, 367; Phosphoramidates , 
Froehler, B.C., et al . , Nucleic Acid Research; 1988, 16, 
4831; Methylphosphonates , Miller, P.S., et al . , 1985, 
Biochimie, 1985, 67, 769. A method of producing 
ol igonucleotoids having chiral -specific phosphorus-containing 
linkages is described m U.S. Pat. No. 5,212,295. Chirally- 
specific oligonucleotides having the appropriately selected 
isomer hybridize to DNA with improved stability. 

Pentosef uranosyl containing nucleobases that are linked 
by non- phosphorus nucleobases that can be used as deoxyribo- 
type nucleobases, which are termed nucleosides. Nucleosides 
that form duplexes with DNA that are at least as stable as 
DNA/DNA duplexes are linked by the various linkage 
chemistries. The chemistries and methods for their use m 
ol igonucleobases are described as follows: 

methylhydroxylamme linkages, Vasseur et al . , J. Am. Chem. 
Soc. 1992, 114, 4006, U.S. Pat, Nos . 5,386,023 and 5,489,677; 
alkylene dioxy linkages, U.S. Pat. No. 5,223,618; and 
3 ' -Thiof ormacetal , Jones et al . , J. Org. Chem. 1993, 58, 
2983 . 

Other nucleosides that can be used m CKV include: 
Carbamates, Stirchak et el., J. Org, Chem. 1987, 52, 4202; 
Sulfonate U Sulfonamide, Glemarec et al . , Tetrahedron 1993, 
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49, 2287, Reynolds et al . , J. Org. Chem. 1992, 57, 2983; 
Sulfone, Huang, Z., J. Org. Chem. 1991, 56, 3869; Sulfamate, 
Huie, E.M., et al . , J. Org. Chem., 1992, 57, 4569; and 
Diisopropylsilyl & silyl, Cormier and Ogilvie, Nucleic Acids 
Res. 1988, 16, 4583, Ogilvie & Cormier, Tetrahedron Lett. 
1985, 26, 4159. 

Pentosef uranosyl containing nucleobases can be of either 
the ribo-type or 2 ' -deoxyribo- type . At least three ribo-type 
nucleobases for use in CMV must be nuclease resistant. 
Suitable nuclease resistent ribo-type nucleobases can be 
selected from the group of ribonuclease resistant 
nucleobases, consisting of 2 ' AX-nucleosides , 2'AX- 
nucleotoids, 2 ' AR-nucleot ides, where A= O, CI or Br and 

when A=0, then X=H or C^ ^ ^nd R=C:.^,, or when A is a halogen, 
then X and R are omitted. 

Nucleobases that lack a pentosef uranosyl moiety can be 
used as deoxyribo- type nucleobases. Suitable examples 
include replacement of the pentosef uranosyl phosphate moiety 
by a Morpholino carbamate, Wang & Weller, Tetrahedron Lett., 
1991, 32, 7385 and peptide nucleic acids m which the 
pentosef uranosyl phosphate moiety is replaced by an 
aminoethylglycine . Peptide nucleic acids (PNA) are described 
m Egholm et al . , J. Am. Chem. Soc . , 1992, 114, 1895 and 
Huang, B.S. et al . , J. Org. Chem., 1991, 56, 5006 and WO 
92/20703 to Buchardt et al . ; methods of making 
PBA/oligonucleot ide chimeric polymers is described in WO 
95/14706 . 

Those skilled in the art understand that PNA can 
hybridize to DNA in either orientation, i.e., either end of a 
PNA can be the 3 ' or 5 ' end. Peffer, N.J,, et al . , 1993, 
Proc . Natl. Acad. Sci . 90:10648-52. When a peptide 
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nucleobase is present in an ol igonucleobase strand having 
pentosefuranosyl -containing nucleobases, the 3' and 5' ends 
of the strand are determined by the orientation of the 
pentosefuranosyl moieties or, if none are present in the 
chain having the peptide nucleobase, then the 3' and 5' ends 
of the strand are determined by the orientation of 
pentosefuranosyl nucleobases of the complementary strand. 
Note, that the first strand of a CMV must contain at least 
three pentosefuranosyl nucleobases . 

7 . EXAMPLES 

Example 7,1. THE USE OF CMV TO REPAIR EPISOMAL ALKALINE 

PHOSPHATASE 

An expression plasmid containing the wild- type human 
1 iver/bone/kidney alkaline phosphatase cDNA under the control 
of the SV4 0 early promoter was obtained and designated pHAP . 
An identical plasmid with the mutant version of the cDNA was 
obtained and designated p711. The design of CMV to 
interconvert the sequences of pHAP and p711 are diagrammed in 
Figure 2A. The CMVChl was designed to repair the missense 
mutation at position 711. It has a G residue, the wild-type 
sequence, at the site corresponding to the mutation, Ch2 has 
a design identical to Chi except for an A instead of G at the 
site corresponding to position 711, Ch3 has the same 
sequence as Chi but the sequence of the ribonucleotide 
segments is that of the coding strand of the alkaline 
phosphate gene instead of the non-coding strand. The 
oligonucleotide Dhl contained the same sequence as Chi, but 
contained only 2 ' -deoxynucleot ides . 

The schematic of p711 in Figure 2B shows the single 
point mutation, A, at position 711 m the coding region of 
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the alkaline phosphatase cDNA, the SV40 early promoter (Pe) , 
SV40 origin of replication (ori) , polyadenylat ion addition 
site and small-t intron sequence for splicing (SV40 poly A) . 
The dotted box in Figure 2B indicates the sequence from 
pBR322 encoding the origin of replication and (3-lactamase 
(Amp**) gene. CHO cells were transfected with p711 and 6 h 
later the CMV, Chi, was introduced to CHO cells previously 
transfected with p7ll. Both transf ect ions were performed 
with lipofectin. The extent of the conversion to the wild- 
type phenotype was monitored at both biochemical and DNA 
sequence levels, by spect rophotometric measurement, 
n iscochemical staining and analysis of Hirt DNA. 

MATERIALS AND METHODS 

Sy^nthesis and purification of oligonucleotides: The 
chimeric oligonucleotides were synthesized on a 0.2 /xmole 
scale using the 1000 A wide pore CPG on the ABI 3 94 DNA/RNA 
synthesizer. The exocyclic amine groups of DNA 
phosphoramidites (Applied Biosystems, Foster City, CA) are 
protected with benzoyl for adenine and cytidine and 
isobutyryl for guanine. The 2 ' -O-methyl RNA phosphoramidites 
;Glen Research, Sterling, VA) are protected with 
Dhenoxyacetyl group for adenine, dimethyl formamidine for 
guanine and isobutyryl for cytidine. After the synthesis was 
complete, the base-protecting groups were removed by heating 
m ethanol : concentrated ammonium hydroxide (1:3) for 20 h at 
55°C. The crude oligonucleotide sample was mixed with 7M 
urea and 10% glycerol, heated to 70°C and loaded on a 10% 
polyacrylamide gel containing 7M urea. After gel 
electrophoresis, DNA bands were visualized by UV shadowing, 
dissected from the gel, crushed and eluted overnight in TE 
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buffer (10 mM Tris-HCl and 1 mM EDTA, pK 7.5) with shaking. 
The eluent containing gel pieces was spun through .45 fim spin 
filter (Millipore, Bedford, MA) and precipitated with 
ethanol. Samples were further desalted by G-25 spin column 
(Boehringer Mannheim, Indianapolis, IN) and greater than 95% 
of the purified oligonucleotides were found to be full 
length . 

Tjransient trans fection and hlstochemical staining : CHO 
cells were maintained m DMEM (B.R.L-, Bethesda, MD) 
containing 10% FBS (B.R.L. Bethesda, MD) . Transient 
transfection was carried out by addition of 10 ^g of 
lipofectm m 1ml of OPTIMEM and added to each well. He 
alkaline phosphatase activity was measured 24h after 
transfection of the oligonucleotide. For histochemical 
staining, the cells were washed with 0.15 M NaCl three times, 
incubated with staining solution for 20 min and fixed with 
50% ethanol. The staining solution consisted of 2mg Fast 
Violet, 2ml Naphtol AS-MX phosphate alkaline solution (Sigma 
Chemical Company, St. Louis, MO) in 50 ml of water. 

SpGCtropbotometric measurement of alkaline phosphate 
actiivity: Transient transfection was carried out in 
triplicates by addition of 1 pg of the plasmid p711 with Ifig 
of lipofectm m 100 al of OPTIMEM (B.R.L. Bethesda, MD) to 
1x10"* CHO cells in a 96-well place. After 6h, various 
amounts of Chi or other CMV were mixed with 1 ^9 of 
lipofectm in 100 ^1 of OPTIMEM and added to each well. 
After 18h, the medium was aspirated and 200 ijlI of DMEM 
containing 10% FBS was adcjed to each well. The alkaline 
phosphate activity was measured 24h after transfection of 
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chimeric oligonucleotides . Spectrophotomet ric measurement 
was carried out by the Elisa Amplification System (B.R.L, 
Bethesda, MD) . Cells were washed with 0.15 M NaCl three times 
and lysed m 100 fil of NP40 buffer containing 10 mM NaCl, 
0.5% NP40, 3mM MgCl2 and 10 mM Tris-HCl pH 7 . 5 . A fraction 
of cell lysates (20 /il) was incubated with 50 fil of Elisa 
substrate and 50 fil of Elisa amplifier (B.R.L. Bethesda, MD) , 
the reaction was stopped by addition of 50 1 of 0 . 3 M H^SO^ 
after 5 min of incubation with amplifier. The extent of 
reaction was carried out within the linear range of the 
detection method. The absorbance was read by an Elisa Plate 
Reader (B.R.L. Bethesda, MD) at a wavelength of 4 90 nm . 



Hirt DNA isolation, colony hybridization and direct DNA 
sequencing- of PGR fragment : The cells were harvested for 
vector DNA isolation by a modified alkaline lysis procedure 
24 h after transfection with the chimeric oligonucleotide. 
The cells were detached by trypsinization , washed, and 
resuspended m 100 ^^l of a solution containing 50mM Tris-HCl 
pH 8.0, lOmM EDTA and 110 ^1 of ^ solution containing 50 mM 
Tri-HCl pH 8-0, 10 mM EDTA and 100 ^g/ml of RNase A. An 
equal volume of cell lysis solution (0.2N NaOH and 1% SDS) 
was added, followed by 100 ^1 of neutralization solution (3M 
KAc, pH 5.5) . A 10 -min room temperature incubation was 
followed by centrif ugation of 10,000 rpm for 10 min. The 
supernatant was extracted with an equal volume of phenol - 
chloroform and precipitated with ethanol . Hirt DNA was 
transformed into E. coli DH5a cells (B.R.L. Bethesda, MD) . 
Colonies from Hirt DNA were screened for specific 
hybrid! zat ion for each probe designed to di st inguish the 
point mutation. Colonies were grown on ampicillin plates, 
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lifted onto nitrocellulose filter paper m duplicate, and 
processed for colony hybridization. The blots were 
hybridized to ^'P-end- labelled oligonucleotide probes, 711-A 
{5 ' -CCGCCTAC&CCCACTCG-3 • (SEQ ID NO:l)) or 711-G (5'- 
CCGCCTACGCCCACTCG-3 * (SEQ ID NO : 2 ) ) at 37°C in solution 
containing Bx Denhardts, 1% SDS , 2x BSC and 100 /ig/ml 
denatured salmon sperm DNA . Blots were washed a* 52 in 
TMAC solution ( 3 . CM t et ramet hylammonium chloride/50 mM Trxs- 
HCl , pH 8.0, 2mM EDTA and 0.1% SDS > . Plasmid DNA was made 
from twenty colonies shown to hybridize to either 711 -G or 
711 -A, using the Qiagen mmiprep kit (Chatworth, CA) . 
several hundred bases flanking the 711 position of each 
pdasmid were sequenced in both direction by automatic 
sequencing (ABI 373A, Applied Biosystem, Foster City, CA) . A 
190 bp ?CR-amplif ied fragment was generated by Vent^ 
polymerase (New England Biolabs, Beverly, MA) utilizing two 
primers ( 5 ' -CAATGTCCCTGATGTTATGCA- 3 ' (SEQ ID NO : 3 ) and 5'- 
CGCTGGGCCAAGGACGCT-3 ' (SEQ ID NO : 4 ) ) ^ corresponding to 
position 630-650 and 803-822 of the alkaline phosphatase cDNA 
flanking the 711 position. The fragment was gel-purified and 
sub:)ected to automatic DNA sequencing (ABI 373A, Applied 
Biosystem, Foster City, CA) . 

Oligonucleotide stahlli ty measurement: Ten ng cf the ^^P-end- 
labelled oligonucleotide was mixed with 500 ng cf the 
unlabelled oligonucleotide and transfected as described 
above. In order to reduce a nonspecific binding of 
oligonucleotides, cells were washed extensively with PBS and 
a solution containing 1 M NaCl/HAc pH 2 . 5 . A crude lysate 
was prepared by lysmg the cells in a solution containing 
lOmM Tris-KCl pH 7.5, 0.5 mM MqCl. and 0.5% Triton x-100 
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followed by phenol -chloroform extraction. Lysates were 
analyzed by 15% polyacrylamide gel containing 7M urea 
followed by autoradiography. Oligonucleotides incubated m 
DMEM containing 10% FBS were processed and analyzed m the 
same manner . 

In our experimental design, various chimeric 
oligonucleotides were introduced into CHO cells previously 
transfected with p7ll. The extent of the conversion to the 
wild-type phenotype was monitored by histochemical staining; 
red pigment was deposited on the cells expressing an active 
enzyme. When cells with he mutant gene were transfected with 
Chi, red cells appeared at a frequency, of approximately one 
in three transfected CHO cells, on the average, at 11 nM. In 
contrast, neither Ch2 nor Dhl caused an increased enzymatic 
activity. Conversion to wild- type was observed at a low 
level when cells were transfected with Ch3 . The transfection 
frequency measured by the expression of the wild- type plasmid 
pHAP was estimated to be 30%. 

The enzymatic activity was also measured by 
spectrophotometric method described above. A dose -dependent 
increase of alkaline phosphatase activity was observed up to 
17 nM of Chi in the presence of p711 plasmid. The enzymatic 
activity of cells treated with Chi at 17nM approached 60% of 
that observed from cells transfected with the wild-type 
plasmid, pHAP . The increase was sequence-specific since the 
same amount of Chi did not affect enzymatic activity of cells 
transfected with pHAP . Furthermore, Ch2 which contained a 
single base pair change from the Chi sequence did not cause 
any increase in enzymatic activity. The oligonucleotide, 
Dhl, which contained the same sequence as Chi, but did not 
contain a ribonucleotide segment, did not exhibit an 
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increase. Thus, spectrophocometric measurements of alkaline 
phosphatase activity were consistent with the result from the 
histochemical staining . 

Correction of a point mutation of the targeted DNA sequence 
by the chimeric oligonucleotide: In order to confirm the 
change at the DNA sequence level, a Hirt extract was made 
from the cells transfected with the p711 and various 
oligonucleotides by a modified alkaline lysis procedure, 
Wang. G, et al . , 1995, Mol . Cell, Biol. 15, 1759, 24 h after 
transfection of the chimeric oligonucleotide. Hirt DNA 
transformed DH5a cells efficiently, resulting in 10' Amp^ 
colonies from 10^ transfected CHO cells. DH5a transf ormant s 
were screened for specific hybridization with a probe 
designed to distinguish between the point mutation (A) and 
the wild-type (G) sequence, corresponding to position 703-719 
of mutant and normal cDNAs , respectively, Weiss, M J . , 2 988, 
Proc. Natl. Acad. Sci . 85:7666. The frequency of correction 
was measured by averaging the number of colonies hybridized 
to the 711-G or 711-A probe using more than 500 colonies 
hybridized to the 711-G or 711-A probe using more than 500 
colonies of the multiple plates generated from at least two 
separate transfection experiments (Table 1). Similar 
frequencies of conversion were observed for two batches of 
Chi prepared by separate synthesis. Approximately 70% of the 
colonies generated from the Hirt DNA made from cells 
transfected with p7ll and Chi hybridized to the 711-A probe, 
while 30% of colonies exhibited hybridization to the 711-G 
probe (Table I) . Thus, a correction frequency of 30% was 
observed at llnM of Chi, reproducibly . Hybridization was 
specific and no cross - hybridi zat ion was observed between the 
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two populations. DNA sequencing was carrxed out with plasmid 
DMAs prepared from twenty of these colonies in both 
directions utilizing two primers ( 5 ' -CAATGTCCCTGATGTTATGCA- 3 ' 

(SEQ ID NO:5) and 5 ' -CGCTGGGCCAAGGACGCT- 3 ' (SEQ ID N0:6)), 
corresponding to position 630-650 and 803-822 of the alkaline 
phosphatase cDNA flanking the 711 position. The sequence 
conversion was confirmed in each case and no other alteration 
in sequence was observed within several hundred of bases 
surrounding the target nucleotide. All colonies froni the 
Hirt extract prepared from Ch2 or Dhl- treated cells 
hybridized tc the 711 -A probe only (Table I) . Some colonies 
from the Hirt extract of the Ch3 hybridized to the wild-type 
probe, but to a much lesser extent than that of the Chi 

(Table II). These results confirmed that the differential 
alkaline phosphatase activities exhibited were due to the 
correction of the point mutation (A to G) at the DNA sequence 
level . 
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Table 11. Hybridization pattern of transf ormant s 
from Hirt extract prepared from duplicate 
transf ections of the p711 plasmid and various 
oligonucleotides at 11 nM. 
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RecA-def icient E. coli strains used to propagate plasmid 
DNA are capable of repair and homologous pairing functions 
using episomal DNA. In order to rule out the possibility 
that the sequence conversion is mediated by E. coli, direct 
DNA sequencing of a PGR- ampl i f ied fragment of Hirt DNA was 
carried out. Two primers flanking the 711 position were 
utilized to generate a 190 bp fragment through the action of 
VentR polymerase. The results indicated that position 711 
was a mixture of A (70%) and G(30%) when the Hirt DNA sample 
was made from the cells transfected cells with the 
combination of p711 and Chi. In contrast, no mixed sequence 
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was observed at position 711 when Hirt DNA was made from 
oligonucleotide Dhl . These results established clearly that 
sequence correction by the chimeric oligonucleotide occurred 
in mammalian cells. 

Stability of chimeric oligonucleotide: The stability of the 
chimeric oligonucleotide was measured intracellularly and in 
growth medium containing 10% FBS . Ten nanogram of 
radiolabelled oligonucleotide, Chi, was added to the same 
transfection experiment in which histochemical staining and 
Hirt DNA analyses was conducted (see Materials and Methods) . 
The chimeric oligonucleotides are extremely stable. No 
detectable degradation was observed when chimeric 
oligonucleotide was incubated in growth medium containing 10% 
FBS, after 24 h incubation. Moreover, oligonucleotide 
isolated from cells did not exhibit any degradation during 
the same incubation time. Only monomers of the chimeric 
oligonucleotide were detected when isolate from cells 24 h 
after incubation. Thus, under the experimental conditions 
employed here, no end-to-end litigation of chimeric 
oligonucleotides was observed. 

Example 7.2, THE USE OF CMV TO MUTATE THE S-GLOBIN GENE 

IN EBV- TRANS FORMED CELL LINES 

A CMV designed to repair the mutation found m Sickle 

Cell Disease 11,-globin was designed SCI, Figure 3. The 

molecule was composed of DNA residues with two intervening 

blocks of ten 2 ' -O-methyl RNA residues flanking a short 

stretch of five DNA residues. When the molecule was folded 

into the duplex conformation, one strand contained only DNA 

residues while the other strand contained the RNA/DNA blocks. 
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In this case, the internal sequence is complementary to the 
1^^ globin sequence over a stretch of 25 residues that span 
the site of the mutation, with the exception of a single 
base (T) which is m bold and designated with an asterisk. 
The five DNA residues flanked by RNA residues were centered 
about the mutant T residue m the coding sequence. A 
control chimeric oligonucleotide (SC2) was designed in the 
same manner with the exception of the base (A) designated in 
bold and with an asterisk. Genomic sequences of the , 
and closely-related 6-globin genes are also displayed in 
Figure 3A with the specific site of E>^' mutation printed in 
bold . 

Lymphoblastoid cells were prepared as follows. Heparin- 
treated blood was obtained from discarded clinical material 
of a patient with sickle cell disease and from one of the 
investigators who had neither history nor symptoms of the 
disease. Mononuclear cells were prepared from blood (=8 ml) 
by density gradient centrif ugation m Ficoll and infected 
with Epstein-Barr virus which had been propagated in the 
marmoset cell line E95-8 (Coriell Institute for Medical 
Research #GM07404D) . Infections were performed with addition 
of C-1 mg leucoagglut mm PHA-L in 10 ml RPMI medium 
supplemented with 20% fetal bovine serum in a T25 flask. 
Cultures were fed twice a week starting on day 5 and were 
considered established once 60-70% of the cells remained 
viable at day 21. The fi'' and lymphoblastoid cells were 
maintained in RPMI medium containing 10% fetal bovine serum. 

The CMV was introduced into the above -described 
lymphoblastoid cells homozygous for the 13.^ allele as follows. 
Cells (1 X 10^ per m.l ) were seeded m 1ml of medium in each 
well of a 24-well tissue culture plate the day prior to the 
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experiment. Transf actions were performed by mixing chimeric 
oligonucleotides with 3 mg of DOTAP (N-[l-(2,3- 
Dioleoyloxy) propyl ) -N, N, N-trimethylammonium methylsulf ate , 
Boehringer-Mannheim) in 20 ml of 20 mM HEPES, pH 7.3, 
incubated at room temperature for 15 min, and added to the 
cultured cells. After 6 h the cells were harvested by 
centrif ugation, washed and prepared for PGR amplification 
following the procedure of E.S. Kawasaki, PGR Protocols, Eds. 
M.A. Innis, D.H. Gelfand, J.J. Sninsky and T.J. White, ppl46- 
152, Academic Press, (1990). 

Gorrection of the single base mutation was assessed by 
taking advantage of well known restriction fragment length 
polymorphisms resulting from the E.^ mutation, R.F, Greeves et 
al . , 1981, Proc. Natl. Acad. Sci . 78:5081; J.G. Chang and 
Y.W. Kan, 1982, N. Eng. J. Med. 307:30; S.H. Orkin et al . , 
ibid., p. 32; J.T. Wilson et al . , 1982, Proc, Natl. Acad. 
Sci- 79:3628. The A to T transversion in the allele 
results in the loss of a Bsu36I restriction site (CGTGAGG) . 
Thus, the 1^^ allele can be detected by Southern hybridization 
analysis of genomic DNA cut with Bsu36I. A 1.2 kbp Bsu36I 
DNA fragment of the IS-globin gene present normally is absent 
in the 1^^ allele and is replaced by a diagnostic 1.4 kbp 
fragment. When genomic DNA recovered from homozygous 
lymphoblastoid cells was analyzed by this procedure, the 
expected 1.4 kbp fragment was observed. However, two 
fragments were observed in DNA from cells transfected with 
the SGI GMV. The presence of the 1.2 kbp fragment in 
addition to the 1.4 kbp fragment indicates partial correction 
of the allele had taken place in a dose-dependent fashion. 

To measure the efficiency of correction rapidly and 
sensitively, we adapted a PCR-based RFLP analysis. For the 
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analysis of the S-globin sequence, the 345 bp PGR fragment 
was prepared by amplification from a crude cell lysate using 
primers BG02 ( 5 ' -TCCTAAGCCAGTGCCAGAAGA- 3 • (SEQ ID NO : 7 ) ) and 
BG0 5 ( 5 ' -CTATTGGTCTCCTTAAACCTG-3 ' (SEQ ID NO : 8 ) ) and Expand 
Taq polymerase (Boehringer Mannheim) . For the analysis of 
the 6-globin gene, the same cell extracts were used in 
amplification reactions with primers E)G06 (5'- 
CTCACAAACTAATGAAACCCTGC-3 ' (SEQ ID NO : 9 ) ) and DG07 (5'- 
GAAAACAGCCCAAGGGACAG-3 ' (SEQ ID NO: 10)) to generate a 335 bp 
fragment. Gels were stained with SYBR"^*^ green (FMC 
Bioproducts) and fluorescence intensities were quantitaced 
using a Molecular Dynamics f luoroimager . DNA sequencing was 
performed m both directions using an ABI 373A sequencer. 

The above primers were designed to yield a 345 bp 
fragment spanning the site of the iS^ mutation after PGR 
amplification of genomic DNA. The fragment from normal cells 
contained a Bsu36I recognition sequence and yielded fragments 
of 228 bp and 117 bp, while DNA from gene contained the 
sequence GCTGTGG and remained refractory to cutting. 
Analysis indicated that the 345 bp DNA fragment amplified 
from SCI -treated cells was partially cleaved with Bsu36I, 
indicating correction of the mutation on some, but not all, 
chromosomes. A quantitative measure was obtained by 
comparing the relative intensities of the three DNA fragments 
after elect rophoret ic separation and staining with the 
fluorescent dye SYBR"''^ green. The stained bands were imaged 
using a laser fluoroimager and the relative levels were 
calculated. Conversion efficiency was quantitated by 
scanning the cyber green- stained agarose gel with a 
fluoroimager. Experiments at doses of between 2.5 and 25.0 
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pM of SCI / 10^ lymphoblastoid cells showed between about 
40% and 55% conversion of fi^ to (Figure 4A) . 

The frequency of introduction of the sickle mutation by 
the CMV SC2 was also determined by the methods set forth 
above. Analysis indicated that the level of correction 
exceeded 50% at the highest level of input chimeric molecule, 
25 nM, but even at 2.5 nM correction of 30% of S>-globin genes 
was observed (Figure 4B) . 

Direct sequencing of the PGR amplified 345 bp fragment 
was performed to confirm the T to A change in the coding 
strand. In the DNA sample from i^^ cells transfected with 
chimeric molecule SCI at a higher concentration greater than 
12 nM/10^ cells. Sequence analysis revealed an approximately 
equal mixture of A and T residues at the site of the 1^^ 
mutation. DNA from untreated cells contained only T at 
that position and DNA from S'^ cells contained only A when 
ti-eated with SCI. Treatment of i^-^ cells transfected with the 
control CMV SC2 caused no change in the S-globin gene 
sequence. However, the DNA from normal cells transfected 
with SC2 was partially converted to the mutant sequence as 
evidenced by a mixture of T and A residues at the expected 
position of the sequence. 

The specificity of the action of CMV was assessed by 
sequencing the related 6-globin gene, which is more than 90% 
homologous to the ll.-globin gene. The S and 6 globin genes 
are identical over the 5 bp DNA core targeting region of SCI. 
Two single base differences are underlined in Figure 3. To 
determine whether SC2 altered the 5-globin gene, DNA sequence 
analysis was performed as above. The results showed that no 
alteration was introduced into the 6-globin gene " by the SC2 
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CNTV in contrast: to the observed change directed by SC2 in the 
S^-globm sequence . 

Example 7.3. THE EXPERIMENTAL USE OF CMV TO 

MUTATE THE E-GLOBIN GENE OF A 
HSC 

METHODS AND MATERIALS 

Stem cell isolatian and transf ec t i on : Normal volunteers 
were given G-CSF 300 /ig S.C. twice a day for five days. On 
the fourth and fifth days of G-CSF therapy they underwent a 
four hour stem cell apheresis using a C03E spectra pheresis 
machine. Mononuclear cells were prepared by density gradient 
centrif ugat ion on Ficol 1 - Hypaque (density 1.077 g/ml, 
Pharmacia) {2000 rpm, 10 min, room temperature) . The 
majority of the monocytes were removed after adherence to 
plastics (30 mun, 37^ C m 5% CO;, m RPMI with 10% FBS ) . 
Cells were harvested by swirling to remove cells loosely 
adherent to the plastics which were washed 3 times with PBS. 
This population was incubated with biotinylated murine anti- 
CD34 antibodies m PBS/1% BSA for 25 mm at room temperature 
at a concentration of 100 x 10" cells/ml . The antibody- 
treated cells were passed over an avidin column and those 
passing through the column were then collected for analysis. 
Subsequently the column was washed with PBS and CD34' cells 
adhering to the column were recovered by squeezing the 
column. Final purities were assessed by FACS . 

Cells were re-suspended in RPMI with 10% FCS heat 
inactivated and 1 x 10^ cells/ml were plated in a 24 well 
pate with each well receiving 1 x 10^ cells. The indicated 
amounts of chimeric oligonucleotide were mixed with 3 /ig 
DOTAP m 20 of 20 mM HEPES , pH 7.3. The m.ixture was 
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incubated on ice for 15 minutes then added to the cells. 
After 16 hr at 21'' C, 5% COj, the cells were harvested, 
pelleted, washed with PBS and lysed with lysis buffer. 

PGR Amplification and Analyses : Genomic DNA was 
amplified for PCR by using PCOj ( 5 ' - TCCTAAGCCAGTGCCAGAAGA- 3 ' 
(SEQ ID N0:11)) and PCOs ( 5 ' -CTATTGGTCTCCTTAAACCTG- 3 ' (SEQ ID 
N0:12)) respectively and Expand Taq polymerase (Boehringer 
Mannheim, Indianapolis, IN) in a 50 fxl reaction at 94° C for 
30 sec, 52.5° C for 30 sec, 72" C for 30 sec for 35 cycles to 
generate a 345 bp fragment. For the 5 locus, the 5' primer 
was 5 ^ -CTCACAAACCTAATGAAACCCTGC-3 ^ (SEQ ID NO:13) and the 3' 
primer was 5 ' -GAAAACAGCCCAAGGGACAG- 3 ' (SEQ ID N0:14) at 94^^ C 
for 30 sec, 59"" C for 30 sec, 72'' C for 30 sec for 35 cycles. 

The PCR product was digested with either JDde J or BSU36I 
restriction endonucleases (New England Biolabs, Beverly, MA) 
and loaded onto 1.2% agarose gel (IX TBE) and 

electrophoresed. The gel was stained for 20 min in 200 ml of 
IX TBE containing 1:20,000 cyber green strain (FMC, Rockland, 
ME) in the dark and quantitated by fluoroimager (Molecular 
Dynamics, Sunnyvale, CA) . The PCR product was spun through a 
Qiaquick PCR purification spin column (Qiagen, Chatsworth, 
CA) in HjO, dried down by vacuum to 5 fxl and the 
concentration determined spectroscopically by O.D at 260 nm. 
The DNA samples (30 /ig) were sequenced directly by an 
automated Applied Biosystems Model 373A DNA sequencing system 
(Applied Biosystems, Foster City, CA) . 

Synthesis and purification of oligonucleotides: The chimeric 
oligonucleotides were synthesized on a 0.2 /imole scale using 
the 10 00 A wide pore CPG on the ABI 3 94 DNA/RNA synthesizer. 
In this construct, the exocyclic amine groups of DNA 
phosphoramidites (Applied Biosystems) are protected with 
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benzoyl for adenine and cytidine and isobutyryl for guanine. 
The 2'-0-methyl RNA phosphoramidi t es (Glen Research, Sterling, 
VA) are protected with phenoxyacetyl group for adenine, 
dimethyl formamidine for guanine and isobutyryl for cytidine. 
After synthesis, the base-protecting groups were removed by 
heating m ethanol : concentrated ammonium hydroxide (1:3) for 
20 h at 55" C. The crude oligonucleotides were purified by 
polyacrylamide gel electrophoresis and the sample was mixed 
with IM urea and 10% glycerol, heated to 70"^ C and loaded on 
a 10% polyacrylamide gel containing 7M urea. After gel 
electrophoresis, DNA bands were visualized by ITv shadowing, 
dissected from the gel, crushed and eluted overnight in TE 
buffer (10 mM Tris-HCl and ImM EDTA, pH 7.5) with shaking. 
The eluent containing gel pieces was spun through 0.45 ptm 
spin filter (Millipore, Bedford, MA) and precipitated with 
ethanol. Samples were further desalted by G-25 spin column 
(Boehringer Mannheim) and greater than 95% of the purified 
oligonucleotides were found to be full length. 

RESULTS: The isolated CD34 enriched population was utilized 
first m an oligonucleotide uptake experiment. The chimeric 
molecule SC2 was mixed with the liposome formulation DOTAP 
under the conditions described above except that a 
radioactive tag was placed at the 5' end of the 

oligonucleotide. Increasing amounts of labeled and unlabelec 
oligonucleotide were incubated with the liposome for 
15 minutes. The mixture was then incubated with cells for 
6 hours after which the cells were washed extensively with 
PBS to reduce nonspecific binding. The cells were then 
centrifuged and the pellet fraction was washed with 0.2 M 
glycine (pH 4.5) to eliminate any remaining nonspecific 
binding. The radioactivity in the cell pellet was determine 
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by scintillation counting. The chimeric oligonucleotide was 
absorbed by the cell in dose -dependent fashion. Since our 
experimental strategy focused on nanomolar concentrations, we 
did not extend the curve beyond 25 nM. Based on the specific 
activity of the radiolabelled chimeric oligonucleotide and 
assuming that each cell is equally receptive to 
transformation, we estimate that up to approximately 50% of 
the CD34* cell population was transfected with the substrate. 
For each experiment, background levels were assessed by 
mixing radiolabelled chimeric molecules with the cells in the 
absence of DOTAP and this level never exceeded 0.05%. 

A population of CD34 enriched cells containing two 
alleles with genotype were transfected with various 
amounts of SC2 and 3 fig/ml of DOTAP. Genomic DNA was 
isolated 16 h after transfection as described above and the 
extent of P'^ to 3^ conversion was measured by restriction 
enzyme polymorphism and by direct DNA sequencing. Genomic 
DNA isolated from 10^ cells was subjected to PGR 
amplification generating a 345 bp fragment by using the two 
primers PCO. and PCOc, . The P'^-specific sequence is cleaved by 
the restriction enzyme Dde I resulting in three fragments of 
192, 108 and 45 base pairs, respectively, while the 0^- 
sequence would be cleaved only once, leaving a 300 bp and a 
45 bp fragment. An increasing level of the uncut 300 bp 
fragment was observed as a function of increasing 
concentrations of SC2 , indicating conversion of the p>^ to 
genotype, Figure 5. A 50% frequency of conversion was 
observed at relatively low concentrations of chimeric 
oligonucleotide (600 ng = 30 nM x 1 ml). In contrast, no 
conversion was observed in cells treated with SCI, a chimeric 
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molecule which pairs to the 3^ site with perfect 
complementarity . 

In order to confirm the DNA sequence change (A to T) m 
the normal cells, direct DNA sequencing of the 345 bp- 
fragment was carried out. The CD34* population containing 
homozygous alleles was transfected with 23 nM SC2 as 
described. Genomic DNA was isolated, PCR-amplif led and the 
samples subjected to automated DNA sequencing. The DNA 
sequence of P'' alone and (3/"^ treated with the SCI both 
contained T. In contrast, the DNA sequence of the 3* cells 
treated with the SC2 showed a dose -dependent conversion of T 
to A at the expected position. The SC2 CMV contains an (a) 
segment which is identical to the coding strand of the 
globin gene. The CMV designated SC5 contained an (a) segment 
identical to a fragment of the non-coding strand of the f?>- 
globin gene. We repeated the transfection experiments 
described above with SC2 and SC5 . The results, presented m 
Figure 5, show that SC5 was active, though not as active as 
SC2 , and was apparently inactive at concentrations below 
2 0 nM . 

Genomic DNA from P'^ cells that had been treated with SC2 
was PCR-amplif ied utilizing the two 6-globin specific 
primers, PCO^ and PCO7 . Only wild- type 6 -globin sequence was 
found, which confirmed that the SC2 CMV is 3 -globin specific. 
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SEQUENCE LISTING 

;i) GENERAL INFORMATION: 

(1) APPLICANT: Kmiec, Eric B. 

(li) TITLE OF INVENTION: CHIMERIC MUTATIONAL VECTORS HAVING 
NON- NATURAL NUCLEOTIDES 

(ill) NUMBER OF SEQUENCES: 14 

(IV) CORRE S PONDENCE ADDRES S : 

£A) ADDRESSEE: Pennie & 

(B) STREET: 1155 Avenue 

(C) CITY: New York 

(D) STATE: New York 

( E ) COUNTRY : U.S.A. 

(F) ZIP: 10036-2711 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compacible 

(C) OPERATING SYSTEM: PC - DOS/MS - DOS 

(D) SOFTVIARE: Patent In Release #1.0, Version #1.30 

(VI) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: On Even Date Herewith 

(C) CLASSIFICATION : 

(IX) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 8 6 9-9741/8864 

(C) TELEX: 66141 PENNIE 



Edmonds 

of the Americas 



(2) INFORMATION FOR SEO ID NO : 1 : 

II) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CCGCCTACAC CCACTCG 

(2) INFORMATICN FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



-41 



wo 97/48714 



PCT/US97/10538 



(11) MOLECUL^E TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
CCGCCTACGC CCACTCG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE : DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CAATGTCCCT GATGTTATGC A 21 
;2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE : DNA 



!X1) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CGCTGGGCCA AGGACGCT 18 
(2) INFORMATION FOR SEQ ID NO : 5 : 

( 1 ; SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi ) 



SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CAATGTCCCT GATGTTATGC A 



21 
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(2) INFORMATION FOR SEC ID NO : 6 : 

( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCTGGGCCA AGGACGCT 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCTAAGCCA GTGCCAGAAG A 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTATTGGTCrr CCTTAAACCT G 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
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(li) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CTCACAAACT AATGAAACCC TGC 2 3 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 

{ i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAAAACAGCC CAAGGGACAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TCCTAAGCCA GTGCCAGAAG A 21 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(XI ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 2 
CTATTGGTCT CCTTAAACCT G 
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(2) INFORMATION FOR SEQ ID NO : 1 3 : 

ix) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(il) MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION; SEQ ID NO: 13: 
CTCACAAACC TAA.TGAAACC CTGC 2 4 

(2) INFORMATICS FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D^ TOPOLOGY: linear 

(il) MOLECULE TYPE; DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
GAAAACAGCC CAAGGGACAG 2 0 
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<2) INFORMATION FOR SBQ ID NOiISi 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 68 base pairs 

(B) TYPES nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt DNA 
(Ix) FEATURE: 

(A) NAME/KEY: Chi 

(B) LOCATION: 1 . • . 68 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOtl5: 

AGCGCCGCCT ACGCCCACTC GGCTGTTTTC AGCAGCGUGG GCGTAGGCGG CGCUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Ch2 

(B) LOCATION: 1 . . • 68 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

hGCGCCGCCr ACACCCACTC GGCTGTTTTC AGCCGAGUGG GTGTAGGCGG CGCUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
( ix) FEATURE: 

(A) NAME/KEY: Ch3 

(B) LOCATION: 1 , . . 68 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCGCGTTTTC GCGCAGCGCC GCCUACGCCC ACUCGGCUGT TTTCAGCCGA GTGGGCGTAG 60 
GCGGCGCT 68 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 base pairs 
<B) TYPE I nucleic acid 

( C ) STRANDEDNESS : a ing le 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 
(Ix) FEATURE: 

(A) NAME/KEY: Dhl 
(B> LOCATION: I . « . 66 
(D) OTHER INFORMATION: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ACCCCCAGCG CCGCCTACAC CCACTCGGCT GACCGG 36 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME/KEY: SCI 

(B) LOCATION: 1 . . . 68 
(D) OTHER INFORMATION: 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

ACCTGACTCC TGAGGAGAAG TCTGCTTTTG CAGACUUCUC CTCAGGAGUC AGGUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NOs21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECUI^E TYPE: DNA 
( ix) FEATURE: 

(A) NAME/KEY: SC2 

(B) LOCATION: 1 * . . 68 
(D) OTHER INFORMATION: 



ACCCCCGCCT ACGCCCACTC GGCTGTTTTC 
TTTCGCGC 



AGCCCAGTGG GCGTAGGCGC CGCTGCGCGT 



60 
68 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOi21i 

ACCTCACTCC TGTGGAGAAG TCTGCTTTTG CAGACUUCUC CACAGCACUC AGOUGCOCCT 60 
TTTCGCGC 53 

(2) INFORMATION FOR SEQ ID NO: 22: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6B base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
( ix ) FEATURE : 

(A) NAME /KEY: SC3 

(B) LOCATION: 1 68 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATCTGACTCC TGAGGAGAAG ACTGCTTTTG CAGUCUUCUC CTCAGGAGUC AGAUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
( ix ) FEATURE : 

(A) NAME/KEY: SC4 

(B) LOCATION: 1. . .68 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACCTGACTCC TGAGGAGAAG ACTGCTTTTG CAGUCUUCUC CTCAGGAGUC AGGUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
( ix ) FEATURE : 

(A) NAME/KEY: SC5 

(B) LOCATION: 1. . .68 
(D) OTHER INFORMATION; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
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GCCCCTTTTC GCGCACCUGA CUCCTGTGGA GAAGUCUGCT TTTGCAGACT TCTCCACAGG 60 
AGTCACGT 68 

(2) INFORMATION FOR SEQ ID NOt25: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH 1 25 base pairs 

(B) TYPE: nucleic acid 
(C> STRANOEDNESS t single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME/KEY: Delta 

(B) LOCATION: 1...2S 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCTGACTCC TGAGGAGAAG ACTGC 25 
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WHAT TS CLAIMED IS: 

1. An ol igonucleobase compound for introducing an alteration 
in a gene of a eukaryotic cell, which comprises: 

a) a first strand of nucleobases having a 3' end and a 5' 
end, which first strand comprises: 

1) a total of at least 15 nucleobases; 

2) at least three nuclease resistant ribo-type 
nucleobases that are selected from the group 
consisting of 2 ' AX-nucleosides , 2 ' AX -nucl eotoids , 
and 2 ' AR-nucleot ides, wherein A=0, F, CI, or Br 
and, when A=0, then X=K or C^.^ alkane and R==C, t> 
alkane, and when A^O, then X and R are om.itted; 
and 

3) at least 3 contiguous ribo-type nucleobases, which 
can be the same as or m addition tc the nuclease 
resistant ribo-type nucleobases; and 

b) a second strand of nucleobases having a 2 ' end and a 
5 ' end, wherein the nucleobases of the second strand 
are Watson-Crick paired to the nucleobases of the 
first strand, 

m which the contiguous ribo-type nucleobases of the first 
strand are Watson-Crick paired to 2 ' -deoxyribo-'ype 
nucleobases and m which at least one ribo-type nucleobase is 
other than a 2'O-methyl substituted nucleotide. 

2. The compound of claim 1, wherein the first strand is a 
region of a ol igonucleobase chain, which chain comprises a 3' 
and a 5 ' terminus , 

3. The compound of claim 2, in which the first strand 
comprises at least two segments of at least 3 contiguous 
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ribo-type nucleobases, which rxbo-type nucieobases are 
Watson-Crick base paired to 2 ' -deoxyribo- type nucleobases. 

4. The compound of claim 3, in which each ribo-type 
nucleobase of the first strand is Watson-Crick paired to a 
deoxyribo- type nucleobase of the second strand, 

5. The compound of claim 4, in which each ribo-type 
nucleobase of the first strand is selected from the group 
consisting of 2 ' AX-nucleosides , 2 ' AX-nucleotoids , and 2'AR- 
nucleotides, wherein A=0, CI, or Br and, when A=0, then 
X=H or C: . alkane and R=C,.^ alkane, and when A^O, then X and 
R are omitted. 

6. The compound of claim 5, which comprises a first chain 
having a first 3' terminus and a first 5' terminus, which 
first chain comprises the first strand, and a second chain, 
having a second 3' terminus and a second 5* terminus, which 
comprises the second strand. 

7. The compound of claim 5, in which for at least one ribo- 
type nucleobase A=F. 

8. The compound of claim 6, 7, in which the first and second 
chains are linked only by Watson-Crick paired nucleobases. 

9. The compound of claim 5, which further comprises a region 
having a sequence of at least 14 contiguous nucleobases or a 
pair of regions having sequences of at least 7 contiguous 
nucleobases wherein each said sequence is the sequence of a 
fragment of a mammalian gene or of its complement. 
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10. The compound of claim 5, which further comprises a 
region having a sequence of at least 14 contiguous 
nucleobases or a pair of regions having sequences of at least 
7 nucleobases wherein each said sequence is the sequence of a 
fragment of a plant gene or of its complement . 

11. The compound of claim 5, in which the first strand 
comprises at least 9 ribo-type nucleobases. 

12. The compound of claim 5, in which the second strand 
contains no ribo-type nucleobases. 

13. The compound of claim 5, which contains no peptide 
nucleobases . 

14. The compound of claim 13, which comprises only 
pentosef uranosyl nucleobases . 

15. The compound of claim 14, which comprises one 
ol igonucleobase chain . 

16. The compound of claim 15, which further comprises: 

a) a hairpin cap attached to the first strand or the 
second strand; and 

b) a linker having a first end and a second end, which 
first end is covalently attached the 3' end of the 
first strand and which second end is covalently 
attached to the 5* end of the second strand. 

17. The compound of claim 16, in which the 5' end of the 
first strand is the 5' terminus. 
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18. The compound of claim 16, in which the 3' end of the 
second strand is the 3' terminus. 

19. The compound of claim 15, which further comprises: 

a) a hairpin cap attached to the first strand or the 
second strand; and 

b) a linker having a first and a second end, which first 
end is covalently attached the 5' end of the first 
strand and which second end is covalently attached to 
the 3' end of the second strand. 

20. The compound of claim 19, m which the 3' end of the 
first strand is the 3' terminus. 

21. The compound of claim 19, in which the 5' end of the 
second strand is the 5* terminus. 

22. The compound of claim 5, in which the first strand 
comprises a first ribo- segment consisting of at least 6 ribo- 
type nucleobases, a second ribo-segment consisting of at 
least 3 ribo-type nucleobases, and an intervening deoxyribo- 
segment consisting of at least 4 2 ' -deoxyribo- type 
nucleobases . 

23. The compound of claim 5, which further comprises a 
protecting group that protects the 3 ' end of the first or 
second strand or the 5* end of the first or second strand. 

24. The compound of claim 5, in which the first strand 
comprises a nuclease resistant ribo-type nucleobase other 
than a 2 ' OMe-ribonucleot ide . 
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25. The compound of claim 5, in which the second scrand 

comprises a 2 ' -deoxyribo-type nucleoside or a peptide 
nucleobase . 

26. A method of isolating a eukaryotic cell having an 
alteration of a target sequence on a chromosome of the cell, 
which alteration causes a selectable phenotypic change of the 
cell, which comprises the steps of: 

a) providing an ol igonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 

1. a total of at least 15 nucleobases; 

ii. at least three nuclease resistant ribo- 

type nucleobases that are selected from 
the group consisting of 2 ' AX-nucleosides , 
2 ' AX-nucleotoids , and 2 ' AR- nucleotides , 
wherein A=0, F, CI, or Br and, when A=0, 
then X=H or ^ alkane and R=Ci.^ alkane, 
and when A^O, then X and R are omitted; 
and 

ill. at least 3 contiguous ribo-type 

nucleobases, which can be the same as or 
in addition to the nuclease resistant 
ribo-type nucleobases , 
in which at least one ribo-type nucleobase is 
other than a 2'O-methyl substituted nucleotide; 

2) a second strand of nucleobases having a 3' end and 
a 5 ' end, 

wherein the nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the first 
strand, and wherein the contiguous ribo-type 
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nucleobases of the first strand are Watson-Crick 
paired to 2 * -deoxyribo- type nucleobases, which 
oligonucleobase further comprises two homlogous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; 

b) maxntaining said oligonucelobase within the nucleus of 
the cell; and 

c) selecting a cell having said selectable phenotypic 
change . 

27. The method of claim 26, wherein the mutator region is 6 
or fewer nucleobases. 

28. The method of claim 26, wherein the alteration is a 
deletion of from 6 to one nucleobase. 

29. The method of claim 26, wherein the homologous regions 
together comprise at least 9 and not more that 25 ribo-type 
nucleobases . 

30. A method of introducing an alteration in a target 
sequence on a chromosome of a cultured mammalian cell, which 
comprises the steps of : 

a) providing an oligonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 

i. a total of at least 15 nucleobases; 

ii. at least three nuclease resistant ribo- 
type nucleobases that are selected from 
the group consisting of 2 ' AX-nucleosides , 
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2 ' AX-nucleotoids , and 2 ' AR-nucleocides , 

wherein A=0, F, CI, cr Br and, when A=0, 
O 

then X=H or C^.^ alkane and R=C^ ^ alkane, 
and when A*0, then X and R are omitted; 
and 

iii. at least 3 contiguous ribo-type 

nucleobases, which can be the same as or 
m addition to the nuclease resistant 
ribo-type nucleobases; and 
2) a second strand of nucleobases having a 3' end and 

a 5' end, 

wherein the nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the first 
strand, and wherein the contiguous ribo-type 
nucleobases of the first strand are Watson-Crick 
paired to 2 ' -deoxyribo-type nucleobases, which 
ol igonucleobase further comprises two homlogous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; and 

b) maintaining said ol igonucl eobase within the nucleus of 
the cell, 

whereby the alteration is introduced in the target sequence, 
provided that the alteration is other than an alteration that 
produces a selectable phenotype . 

31. The method of claim 30, wherein the mutator region is 
six or fewer nucleobases . 

32. The method of claim 30, wherein the alteration is a 
deletion of from six to one nucleobases. 
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33. The method of claim 30, wherein the homologous regions 
together comprise at least 9 and nor more that 25 ribo-type 
nucleobases . 

34. The method of claim 30, wherein^ the alteration repairs a 
disease-causing mutation in a human cell. 

35. The method of claim 30, wherein the alteration results 
in the inactivation of the gene that contains the target 
sequence . 

36. A method of introducing an alteration in a target 
sequence of the genome of a plant cell, which comprises the 
steps of : 

a) providing an oligonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 
i. a total of at least 15 nucleobases; 

ix. at least three nuclease resistant ribo- 

type nucleobases that are selected from 
the group consisting of 2 ' AX-nucleosides , 
2 ' AX-nucleotoids, and 2 ' AR- nucleotides , 
wherein A=0, F, Cl , or Br and, when A:=0, 
then X=H or C,.f, alkane and R = Ci.6 alkane, 
and when A^O, then X and R are omitted; 
and 

iii. at least 3 contiguous ribo-type 

nucleobases, which can be the same as or 
in addition to the nuclease resistant 
ribo-type nucleobases; and 
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2) a second strand of nucleobases having a 3' end and 
a 5 ' end , 

wherein zhe nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the fxrst 
strand, and wherein the contiguous ribo-type 
nucleobases of the first strand are Watson-Crick 
paired to 2 • -deoxyribo- type nucleobases, wnich 
oligonucleobase further comprises two homologous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; and 

b) maintaining said oligonucleobase within the nucleus of 
the cell, 

whereby the alteration is introduced in the target sequence. 

37. The method of claim 36, wherein the mutator region is 
SIX or fewer nucleobases. 



38. The method of claim 36, wherein the alteration is a 
deletion of from six to one nucleobases. 



39. The method of claim 36, wherein the homologous regions 
together comprise at least 9 and not more that 25 ribo-type 
nucleobases . 
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